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PARIKH MATRICES AND PARIKH REWRITING SYSTEMS 


WEN CHEAN TEH 


Abstract. Since the introduction of the Parikh matrix mapping, its injec¬ 
tivity problem is on top of the list of open problems in this topic. In 2010 
Salomaa provided a solution for the ternary alphabet in terms of a Thue 
system with an additional feature called counter. This paper proposes the 
notion of a Parikh rewriting system as a generalization and systematization 
of Salomaa’s result. It will be shown that every Parikh rewriting system 
induces a Thue system without counters that serves as a feasible solution to 
the injectivity problem. 


1. Introduction 

The Parikh matrix mapping in was originally introduced as an extension of 
the Parikh mapping [m. Parikh matrices provide more structural information 
about words than Parikh vectors and is a useful tool in studying subword occur¬ 
rences. Two words are M-equivalent iff they have the same Parikh matrix. The 
characterization of M-equivalence, also known as the injectivity problem, and 
the closely related ambiguity problem have garnered the most interest among 
researchers in this area [TI[7l[9l [T3l - [20] . Complete characterization for the case 
of the binary alphabet was obtained M. However, the case of the ternary 
alphabet proves to be elusive. 

The first complete characterization was provided by Salomaa [13] in terms 
of a Thue system. Nevertheless, the associated rewriting rules do not pre¬ 
serve M-equivalence. To compensate, a counter is attached to the Thue sys¬ 
tem to keep track of the quantity that decides whether the resulting word is 
M-equivalent to the original word. Later Atanasiu |3] proposed a wider class 
of M-equivalence preserving rewriting rules for the ternary alphabet, based on 
Salomaa’s Thue system. 

This paper introduces the concept of a Parikh rewriting system. It generalizes 
the above Thue system proposed by Salomaa. Then it will be shown that 
“irreducible” transformations extracted from a Parikh rewriting system induces 
a Thue system that characterizes M-equivalence. 

The remainder of this paper is structured as follows. Section 2 provides the 
basic terminology and preliminary. Section 3 systematizes the characterization 
of M-equivalence in terms of Thue systems. The main object of our study, 
namely the Parikh rewriting system, will be introduced in Section 4. Irreducible 
transformations of a Parikh rewriting system will be studied in the subsequent 


2000 Mathematics Subject Classification. 68RI5, 68Q45, 05A05. 

Key words and phrases. Parikh matrix, subword, injectivity problem, Parikh rewriting 
system, M-equivalence. 


1 



PARIKH MATRICES AND PARIKH REWRITING SYSTEMS 


2 


section. The next two sections involve some case stndies for the binary and the 
ternary alphabet. Our conclusions follow after that. 


2. Parikh Matrices 


We will assume the reader is familiar with the basic notions of formal lan¬ 
guages. The reader is referred to [12] if necessary. 

Suppose S is a hnite alphabet. The set of words over S is denoted by S’". 
The empty word is denoted by A. Let S+ denote the set of nonempty words 
over S. If n,!/; € S’", the concatenation of v and w is denoted by vw. An 
ordered alphabet is an alphabet S = { 01 , 02 ,... ,Os} with a total ordering on it. 
For example, if Oi < 02 < < o^, then we may write S = (oi < 02 < ■■■ < o^}. 

On the other hand, if S = (oi < 02 < ••• < o^} is an ordered alphabet, then the 
underlying alphabet is (oi, 02 ,..., o^}. Frequently, we will abuse notation and 
use S to stand for both the ordered alphabet and its underlying alphabet, for 
example, as in “zc e S’"” when S is an ordered alphabet. If tc e S’", then |zi;| is 
the length of w and w[i] is the Tth letter of w. For 1 < z < j < s, let Ojj denote 
the word OjOj+i---Oj. Suppose F n S. The projective morphism 7rr:S’" F’" is 

dehned by 



o, if o € F 
A, otherwise. 


We may write ^^,6 for TT{a,b}- 

Definition 2.1. A word w' is a subword of tc e S’" iff there exist xi,X 2 , ■ ■ ■ 
z/o, 2 / 1 , • ■ •, ?/n £ S’", some of them possibly empty, such that 


w' = XiXr-'Xn and w = z/oXij/r--z/„_iX„z/„. 


In the literature, our subwords are usually called “scattered subwords”. A 
factor is a contiguous subword. The number of occurrences of a word zt as a 
subword of w is denoted by |zz;|„. Two occurrences of u are considered different 
iff they differ by at least one position of some letter. For example, \aabab\ah = 5 
and \haacbc\abc = 2. By convention, |zz;|a = 1 for all w e S’". 

For any integer k>2, let Af a, denote the multiplicative monoid oi kxk upper 
triangular matrices with nonnegative integral entries and unit diagonal. 

Definition 2.2. Suppose S = (oi < 02 < •■• < Os} is an ordered alphabet. The 
Parikh matrix mapping, denoted is the monoid morphism 




dehned as follows: 

if Ts(ag) = then rui^i = 1 for each 1 < z < s + 1, = 1 and all 

other entries of the matrix ^ 2 ( 0 ^) are zero. Matrices of the form Ts(zc) for 
zz; € S’" are called Parikh matrices. 

Theorem 2.3. fWf Suppose S = (ai < 02 < ■■■ < Os} is an ordered alphabet and 
w € S’". The matrix Ts(zz;) = has the following properties: 

• rui^i = 1 for each 1 < z < s + 1; 

• rui^j = 0 for each 1 < j < z < s + 1; 

• = \w\aij for each I < i < j < s. 
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The Parikh vector T(t(;) = jiylaj, ■ ■ •, I'wla^) of a word ty € S* is con¬ 

tained in the second diagonafl of the Parikh matrix \I/s(ty). 

Example 2.4. Suppose S = {a < 6 < c} and w = abcbac. Then 

'I's(r’) = tI/s(a)tI/s(^)fE's(c)Ts(&)Ts(a)Ts(c) 
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Definition 2.5. Suppose S = {oi < 02 <■■■ < a*} is an ordered alphabet. Two 
words w,w' € H* are M-equivalent, denoted w =m w', iff 
A word w € S’" is M-ambiguous iff there exists some distinct word w' ^ Yi* 
M-equivalent to w. Otherwise, w is said to be M-unambiguous. 

3. True Systems in Relation to M-Equivalences 

Characterization of M-equivalences in terms of Thue systems has been carried 
out in [T1141-1^[T4] . However, a systematic treatment of this has not appeared 
in the literature. To us a Thue system is dehned as follows. 

Definition 3.1. A Thue system is an ordered pair (S, R), where E is an ordered 
alphabet!! and 0 R ^ { {y,y') \ y,y' e Y* and y y'}. Each element {y,y') 
of R is associated with two rewriting rules of the Thue system, which are 
suggestively denoted hy y ^ y' and y' y. Suppose w,w' e Y*. We say that w 
transforms directly into w', denoted w w', using the rewriting rule y ->■ y' 
iff 

w = xyz and w' = xy'z for some x,z eY*. 

We say w transforms into w', denoted w w', iff there exist wq,wi, ... ,Wn^ 
Y* such that 

W = Wq Wi -^R W2 -^R ■■■ -^R Wn = w' . 

By dehnition, w ^r w for every w eY*. 

Note that =^r is an equivalence relation on S’". The corresponding equiva¬ 
lence class containing the word w ^Y* will be denoted by [ty]/?. 

Definition 3.2. A Thue system {Y,R) is Parikh sound iff w =m w' for every 
w,w' e Y* such that w ^r w' . A Thue system {Y,R) is Parikh complete iff 
w =>R w' for every w,w' eY* such that w =m w'. 

^The second diagonal of a matrix in Mk+i is the diagonal of length k immediately above 
the principal diagonal. 

^Although unnecessary in this definition, the alphabet is assumed to be ordered out of 
convenience and compatibility with the next definition. 
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A Thue system (S,i?) is Parikh sound and Parikh complete if and only if 
the relation coincides with the M-equivalence relation =m- 

Proposition 3.3. Suppose (S,-R) is a Thue system. If y =m y' for every 
{y,y') e R, then (S,i?) is Parikh sound. 

Proof. The proof is straightforward using the left-invariance, right-invariance, 
and transitivity of M-equivalence. □ 

Example 3.4. Suppose S is any ordered alphabet and R = {{w,w') e x S’" | 
w =M ro' and w i^w'). Then the Thue system (S,i?) is trivially Parikh sound 
and Parikh complete. 

Example 3.5. [HE] Suppose T, = {a < b} and R = {{abxba,baxab) | x e S’"}. 
Then the Thue system (S,i?) is Parikh sound and Parikh complete. 

Example 3.6. Suppose S = {a < 6 < c} and 

R = {(ac, ca)} u { {abxba, baxab) | x e {a, b}* } u { {bcxcb, cbxbc) \ x € {b,c}* }. 

Then the Thue system (S,i?) is Parikh sound but not Parikh complete. The 
two words abbcbacb and bacbabbc are M-equivalent but abbcbacb bacbabbc. 

This Thue system (S,i?) was studied in |1]. The fact that it is not Parikh 
complete was pointed out in |T6], where the counterexample provided is the 
pair of words babcbabcbabcbab and bbacabbcabbcbba, each with fifteen letters. 

Theorem 3.7. Suppose S is an ordered alphabet with |S| > 2. No Parikh sound 
Thue system (S,i?) with R finite is Parikh complete. 

Proof. Suppose S = {a<6<---} and (S, i?) is Parikh sound with R hnite. Let 
N be an integer at least the length of every word appearing in some rewriting 
rule of the Thue system. Let w = ab a---a ba and w' = ba a---a ab. Then 

A/"—Itimes Itimes 

w and w' are M-equivalent. However, every factor of w with length at most 
N is M-unambiguous. Since (S,i?) is Parikh sound, every word appearing 
in its rewriting rules is M-ambiguous. Hence, no rewriting rule of the Thue 
system can be applied to w. Therefore, w w' and thus (S, R) is not Parikh 
complete. □ 


4. Parikh Rewriting Systems 

Now, we introduce our main object of study. 

Definition 4.1. A Parikh rewriting system is a triple ^ = (S,R,C), where 
S = {oi < 02 < < Os} is an ordered alphabet of size at least two, (S,i?) 

is a Thue system such that each rewriting rule preserves the Parikh vector, 
and C £ {oij \ 1 < i < j <s}. The elements of C are called the counters. 
Suppose w,w' € S’". We say that w transforms into w', denoted w =>(p w', iff 
I'li’lc = \w'\c for all c € C and w w'. Two Parikh rewriting systems ^ and £1 
are equivalent iff ^q 3 =^i 3 . 

Note that is an equivalence relation on S’". 

Definition 4.2. Suppose fp = (S,R, (7) is a Parikh rewriting system. 
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(1) Two words w,w' e S* are ^-equivalent iff w =>(p w'. 

(2) A word ty € S’" is '^-ambiguous iff there exists some distinct word w' € S’" 
^-equivalent to w. Otherwise, w is said to be ^-unambiguous. 

It is not absolutely obvious that ambiguous words must exist for every 
Parikh rewriting system 

Lemma 4.3. Suppose S is any alphabet and suppose uji,uj[ e S’" are distinct 
and T(rai) = Recursively, define Wn+i = WnW^ and = w'^Wn- Then 

for every positive integer n, the words Wn and are distinct and \wn\u = 
for every u e S’" such that |u| < n. 

Remark 4.4. Lemma 14.31 can be proved easily by induction on n. The above 
lemma is implicit in the proof of f{t) < 2* that appeared in |8], where f{t) is the 
maximal length such that any word of length at most f{t) over any alphabet 
of size at least two is uniquely determined by its f-spectrum. 

Proposition 4.5. ^-ambiguous words exist for any Parikh rewriting system 
^-{T,R,C). 

Proof. Recall that every rewriting rule of ip preserves the Parikh vector. Sup¬ 
pose {w,w') € R. By our dehnition, w ^ w'. Let wi = w and w[ = w'. 
For every 2 < n < |S|, dehne Wn and w'^ as in Lemma 14.31 We claim that 
=^*11 ^|s|- Si^ce |m| < |S| for every u € C, hj Lemma 14.31 it suffices to 
show that zapi =>r tCpi. This can be proved easily by induction on n. Clearly, 
w = Wi w[ = w'. Assume Wn w'^. By symmetry, Wn. Therefore, 

Wn+i = WnW'n required. □ 

Definition 4.6. Suppose ip = {Yi,R,C) is a Parikh rewriting system. We say 
that ip is sound iff w =m w' for every w,w' e S’" such that w w'. We say 
that ip is complete iff w w' for every w,w' e S’" such that w =m w'■ 

Thus a Parikh rewriting system ip is sound and complete if and only if 
=^qj==M- 

Remark 4.7. If (S, 72,(7) is a complete Parikh rewriting system, then (S,i2) is 
a Parikh complete Thue system. 

Example 4.8. Suppose S = {a < 7} and R = {{ah,ha)}. Then the Parikh rewrit¬ 
ing system (S, 72, {a7}) is sound and complete. This was Lemma 4 in [9], which 
appeared equivalently in the following form. 

Suppose S = {a < 7} and w,w' e S’". Then w =m w' if and only 
if there exist Wo,Wi,... ,Wn e S’" such that w = Wq ->-r Wi -^r 
W 2 -^R ->-R Wn = w' and the number of applications of the 

rewriting rule ab ba equals the number of applications of the 
rewriting rule ba ab. 

Example 4.9. Suppose S = {a < 7 < c}. Let 

72 = {{ac, ca)} u { {abxba, baxab) | t e S’" } u { {bcxcb, cbxbc) | x € S’" }. 

Then the Parikh rewriting system (S, 72, {a7c}) is sound and complete. This is 
Theorem 9 in [1^ and represents the hrst exhaustive solution to the injectivity 
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problem for the ternary alphabet as mentioned in the Introduction. However, 
the rewriting rules do not preserve M-equivalence. This system of Salomaa will 
be studied in Section [71 

Example 4.10. Suppose E = {a < & < c} and R = {(ac,ca),{bc,cb),(ab,ba)}. 
Apparently, w w' for every w,w' € T,* with ^(w) = Therefore, the 

Parikh rewriting system (H, R, {ab,bc,abc}) is trivially sound and complete. 

Example [TTO] suggests the following question; if hnitely many rewriting rules 
are allowed, is there a sound and complete Parikh rewriting system for the 
ternary alphabet not involving all three counters? 

Theorem 4.11. Suppose E = {a < 6 < c} and ^ = (E,i?, (7) is a sound Parikh 
rewriting system. If (E,i?) is not Parikh sound, then abc € C. 

Proof. We argue by contradiction. Assume abc i C and so (7 £ {ab,bc}. Since 
(E,i?) is not Parikh sound, by Proposition 13.31 choose {w,w') € R such that 
w $M W. Since every rewriting rule of a Parikh rewriting system preserves 
the Parikh vector, it follows that |tc|„ ^ \w'\u for some u e {ab,bc,abc}. If 
\w\ah = \w'\ab and \w\iya = \w'\ba, then w =>rp w' but w W, contradicting that ^ 
is sound. Otherwise, either \w\ab ^ \w'\ab or \w\bc ^ \w'\bc. Assume \w\ab + \w'\ab- 
Clearly, ww' w'w and wcw' ^r w'cw because {w,w') e R. By Lemma IT731 
\ww'\u = \w'w\u for u 6 {ab,bc}. Hence, ww' =>sp w'w and since fp is sound, it 
follows that \ww'\abc = Iw'wlabc- On the other hand, \wcw'\ab = \ww'\ab = {w'wlab = 
\w'cw\ab while \wcw'\bc = \ww'\bc+\w\b = \w'w\bc + \w'\b = \w'cw\bc. Hence, wcw' 
w'cw as well. However, \wcw'\abc = \ww'\abc + \w\ab ^ \w'w\abc + \w'\ab = \w'cw\abc, 
contradicting that ^ is sound. The other case is similar. □ 

Corollary 4.12. Suppose E = {a < 6 < c} and ^ = {T,,R,C) is a sound and 
complete Parikh rewriting system. If R is finite, then C = {ah, be, abc). 

Proof. Since ^ is complete, by Remark 14.71 (E,i?) is a Parikh complete Thue 
system. By Theorem 13.71 (E,i?) is not Parikh sound. Hence, by Theorem 14.111 
abc € (7. 

Now, assume ab i C. Let R' = { iy,y') ^ R \ y,y' ^ }. Since (E,R) is 

Parikh complete, it can be easily deduced that ({a < b},R') is Parikh complete. 
If w,w' € {a,b}* and w =^r' w', then w =>(p w' because ab i C. Since ^ is 
sound, it follows that ({a < b},R') is Parikh sound as well. By Theorem 13.71 
again, R' is inhnite and so is R, a contradiction. Hence, ab € (7. Similarly, it 
can be shown that bee C. □ 

5. Irreducible Transformations 

In this section, it will be shown that each transformation of a Parikh rewriting 
system can be decomposed into a sequence of “indecomposable” transforma¬ 
tions. Before that, we need a particular notion of distanced 

Definition 5.1. Suppose (E,R) is a Thue system, w,w' e E’", and w ^r 
w'. The distance between w and w', denoted distR(tc, tc'), is zero if in = w'] 
otherwise, it is the least positive integer n such that there exist Wq,Wi, ... ,Wn^ 
satisfying w = Wo^r Wi ^rW 2 ^r--- ^r Wn = w'. 


^This is distinct from the notion of rank distance, denoted Ir, appeared in ma. 
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The distance function dist/j is indeed a metric on each equivalence class [tc]/?- 
When R is understood, we would simply write dist(t(;, tc'). 

Definition 5.2. Suppose fp = (T) is a Parikh rewriting system and sup¬ 

pose w and w' are distinct words over E. We say that the transformation 
w =>fp w' is reducible iff there exists w" i {w,w'} such that w w" w' 
and distij(ta, w") + disti?(ta", rc') = distij(rc, in'). Otherwise, we say that the 

irr 

transformation is irreducible and denote this by w =>sp w\ The order of an 

irr 

irreducible transformation w w' is n iff distR{w,w') = n. 

Remark 5.3. By dehnition, the identity transformations w w are not re¬ 
garded as irreducible transformations. Let 

m = min{ dist(ta, w') \w,w' ,w =>sp w' and w w'}. 

By Proposition 14.51 m is well-dehned. For every distinct w^w' e S*, if w =>sp w' 

and dist(ta,tc') = m, then w w'. In particular, if w =>q 3 w' and dist{w,w') = 

1 irr 

1 , then w =>(p w. 

irr irr 

It will be shown after Theorem 16.21 that =>q 3 and =>13 need not be equal even 
though ^ and 0 are equivalent Parikh rewriting systems. 

Theorem 5.4. Suppose ^ (7) is a Parikh rewriting system and w and 

w' are distinct words over E. If w =>qj w', then there exist Wo,Wi ,... e E* 
such that 

irr irr irr irr . 

W = Wo =>q 3 Wi =^-q 3 W2 =>q 3 ••• ^qj Wn = W 

and distii{w,w') = Er=i distii;(r(;j_i,zi;i). 

Proof. First of all, if w =>qj w' is irreducible, then the conclusion follows with 
n = 1. We prove by induction on dist(t(;, tc'). If dist(t(;,ty') equals one, then 

w =^q 3 w' is irreducible and we are done. For the induction step, assume 

w ^qj w' is reducible. Choose w" i {w,w'} such that w =>q 3 w" =>q 3 w' and 
dist(t<;, ic") + dist(w", tc') = dist(ta,w'). Clearly, dist(t(;, re") < dist(t(;, in') and 
dist(tc",< dist(ta, w'). By the induction hypothesis, choose uq,ui, ... ,Up ^ 

_ jpp jpp irr irr 

E* such that w = Uq =>q 3 Ui =>q 3 U 2 ^qj ^qj Up = w" and dist(w,ta") = 

XliLidist(nj_i,Mj). Similarly, choose no,ni,...,ng e E* such that w" = Vq ^qj 
vi ^qj V 2 ^qj ■■■ ^qj Vq = w' and dist(t(;", w') = dist(nj_i,Uj). Therefore, 

irr irr irr .. irr irr irr . 

TV — Wq — '^P — 

and dist(ta, w') = dist(nj_i, Uj) + 'Z'j=i dist(nj_i, Uj) as required. □ 

It is not surprising that the sequence of irreducible transformations guaran¬ 
teed by Theorem 15.41 is not unique. However, the non-uniqueness of the length 
of that sequence is not so clear and will be adrressed later. 

Theorem 5.5. Suppose fp = (E, R, C) is a sound and complete Parikh rewriting 

system. Let R' = {{w,w') \ w ^q; w'). Then (E,i?') is a Parikh sound and 
Parikh complete Thue system. 
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Proof. Suppose w^w' € S’". If {w,w') € R', then w =>sp w'. Hence, w =m w' be¬ 
cause ^ is sound. Thus (S, i?') is Parikh sound by Proposition 13.31 Conversely, 
if w =M w', then w w' because fp is complete. By Theorem 15.41 it follows 
that w w'. Thus (S,i?') is Parikh complete. □ 

For some Parikh rewriting systems, the set of irreducible transformations 
may have a simple explicit description (see the very first theorem in the next 
section). Therefore, Theorem 15.51 provides a plausible mean to obtain Parikh 
sound and Parikh complete Thue systems. 

6. The Binary Alphabet: a Case Study 

In this section we study some Parikh rewriting systems for the binary alpha¬ 
bet. Suppose S = {a < 6} is a hxed ordered alphabet. 

Theorem 6.1. Suppose = (Ti,{{ab,ba)},{ab}) and is the Thue sys¬ 
tem where R' = { {abxba,baxab) | x e S’" }. Then for every w,w' e S’", 

w w' if and only if w w'. 

Proof. Let R = {(ab,ba)}. By our definition, neither w w nor w w is 
true for each w e H*. Suppose w,w' € S’" are distinct. First, we show that 

irr 

w =>fp w' if and only if w =>q 3 w' and distR{w,w') = 2. Clearly, if w w' and 
distR(tc, tc') = 2, then this transformation is irreducible by Remark 15.31 

irr 

Conversely, assume w and so w =>q 3 wh Since \w\ab = \w%b-i it 

follows that distij(ta, w') must be even because the number of applications 
of the rewriting rule ab ba must equal the number of applications of the 
rewriting rule ba ab. Assume distR{w,w') = 21 for some I > 1. Suppose 

w = Wq -^r Wi -^R W 2 -^R ■■■ ^R W 21 = w' for some Wo,Wi,... ,W 2 i e S’". Note 

that if both the rewriting rules ab ba and ba ab are involved in some two 
consecutive direct transformations Wi ->r Wi+i ->r Wi+ 2 , then the two rewriting 
rules must be applied to non-overlapping positions; otherwise, Wi = Wi +2 and 
distij(tc, tc') would have been less than 21. In this case, we may apply the rules 
in reverse order and obtain Wi ->r -^r Wi +2 for some e S’". There¬ 

fore, commuting the applications of the rules ab -» ba and ba ab as many 
times as necessary, we may assume that both rewriting rules are involved in the 
first two direct transformations Wq -^r Wi -^r W2. However, this implies that 
w ^qj W 2 =>q3 w' and distij(tc, tc') = distij(t(;, ty 2 ) + disti^(tC 2 , m')- Therefore, 
w ^q 3 w' is reducible, a contradiction. 

By now it should be clear that w -^r' w' if and only if w ^qj w' and 
distij(ta, tc') = 2. Therefore, w ^qj w' if and only if w -^r> w' as required. □ 

Since ^ = (S, {(a6,5a)}, {a6}) is a sound and complete Parikh rewriting 
system (see Example 14.Sp , Theorem 15.51 and Theorem 16.11 together provide an 
alternative proof of the fact stated in Example 13.51 Furthermore, the proof of 
Theorem 16.11 shows that every irreducible transformation of fp has order two. 

Although (E, {(a5,5a)}, {a5}) is the most simple and natural Parikh rewrit¬ 
ing system for the binary alphabet, it is intriguing what other combination of 
rewriting rules such that the rule ab ba is not already included may lead to. 
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In view of Theorem 15.51 our pursuit is accompanied by hope for a discovery of 
a new Parikh sound and Parikh complete Thue system for the binary alphabet. 

For the remaining of this section, £x Ri = {(abb,bab),{bab,bba),{bba,abb)} 
and i ?2 = {(baa, aba), (aba, aab), (aab, baa)}. 

For each i = 1,2, it is easy to verify that abba baab. Since abba =m baab, 
it follows that neither (S,i?i) nor (S,i? 2 ) is a Parikh complete Thue system. 

Theorem 6.2. If R<^ RiU R 2 , Rn Ri 0 for each i = 1,2, and |i?n Ri\ > 2 for 
some i = 1,2, then ^ = (T,, R, {ab}) is a sound and complete Parikh rewriting 
system. 

Proof. Suppose w,w' e S*. We need to show w =>(p w' if and only if w =m w'. 
If w w', then \w\ab = \w'\ah- Since every rewriting rule of ^ preserves the 
Parikh vector, it follows that w =m w'■ Conversely, assume w =m w'■ To see that 
w =>fp w', since (S, R') is Parikh complete, where R' = {(abxba, baxab) | x e S’"}, 
it suffices to show that 

Therefore, it remains to show that abwba =>fp bawab for every w € S’". Fix 
arbitrary tc € S’". Since \abwba\ab = \bawab\ab, it suffices to show that abwba 
bawab. We may assume > 2 as the other case mirrors this. Furthermore, 

any rewriting rule induced by an ordered pair in Ri produces the same effect 
as a combination of rewriting rules induced by the other two ordered pairs in 
Ri. Hence, without loss of generality, we may further assume that Ri £ R. 

Case 1: R2 RiU {(aba,aab)}. 

Note that bawba bawab: if re = A or rc[|tc|] = a, apply the (rewrit¬ 
ing) rule aba -» aab] otherwise, apply the rule bba bab. Hence, it 
remains to show that abwba bawba. Now, if za = A or za[l] = b, 
then abwba -^r bawba, applying the rule abb -> bab. Otherwise, if 
tc[l] = a, write wb as aw'bw" for some w' e a* and w" e S’". Us¬ 
ing the rule aab aba as many times as necessary (possibly none), it 
follows that abwba = abaw'bw"a =^r ababw'w"a. Then ababw'w"a ->-r 
abbaw'w"a ->-r babaw'w"a, using the rule bab bba, followed by the rule 
abb bab. Finally, using the rule aba aab as many times as necessary, 
it follows that babaw'w"a =>r baaw'bw"a = bawba. 

Case 2: R2 RiU {(aab,baa)}. 

First, we show that abwba =^r bawba. If tc = A or tc[l] = b, then apply 
the rule abb ->■ bab. Otherwise, if u’[l] = a, write wb as aw'bw" for 
some w' 6 a* and w" € S’". Using the rule aab baa as many times as 
necessary and then possibly the rule bab bba once (if |z(;'| is even), it 
follows that abwba = abaw'bw"a =>r abbaw'w"a. Then abbaw'w"a ->-r 
bbaaw'w"a, applying the rule abb -> bba. Using possibly the rule bba 
bab once (if is odd) and then the rule baa aab as many times as 
necessary, it follows that bbaaw'w"a ^r baaw'bw"a = bawba. 

Similarly, it can be shown that bawba =^r bawab. If zc[|zn|] = b, then 
bawba ->-r bawab, applying the rule bba bab. Otherwise, if tc = A 
or zn[|zc|] = a, then write baw as w'baw" for some w' e S’" and w" e 
a*. Using the rule aab baa as many times as necessary and then 
possibly the rule bab ->■ bba once (if \w"\ is even), it follows that bawba = 
w'baw"ba =>ij w'bbaw"a. Then using possibly the rule bba bab once 
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(if lif'l is odd) and then the rule baa aab as many times as necessary, 
it follows that w'bbaw"a w'baw"ab = bawab. 

Case 3: R2 RiU {(baa,aba)}. 

This case is similar to Case 1. □ 

Remark 6.3. If \RnRi\ = 1 for both i = 1,2, then (E, R) is not a Parikh complete 
Thue system. Hence, (E,i?, {ab}) is not a complete Parikh rewriting system. 

Suppose fp = (S, R, {ab}), where R = Ri u R 2 , and £3 = (S, {{ab, ba)}, {ab}). 
Then ^ and £3 are equivalent as they are both sound and complete. How- 

irr irr 

ever, we will show that Clearly, w = bbaaabaab abbaabaab 

abbabaaab ->-r abbabaaba = w'. Since w and w' differ in six positions and every 
rewriting rule of ^ affects exactly two positions, it follows that distR(t(;, w') = 3. 
Hence, w =>q 3 w' must be irreducible because no (irreducible) transformation of 

1^—1 

fp has order one. However, by Theorem 16.11 w w'. 

irr 

Although w =^fp w' has order three, w can be transformed into w' via a 
sequence of irreducible transformations of order two, namely 

irr irr 

w = bbasLSihaab =>rp abbbaaaah abbabaaba = w'. 

In fact, since (S, R') is Parikh complete, where R' = { {abxba, baxab) | x e E* }, 
and every direct transformation w -^ri w' is an irreducible transformation of ^ 
of order two, it implies that every non-identity transformation of fp can be ex¬ 
pressed as a sequence of irreducible transformations of order two. Therefore, it 
raises the question whether the irreducible transformations of order at most two 
of a complete Parikh rewriting system are sufficient to form a Parikh complete 
Thue system. This will be answered in the next section. 


7. Salomaa’s Parikh Rewriting System 

For this section, suppose E = {a<6<c}isa hxed ordered alphabet. Let 
fp = (E, R, {abc}) be a hxed Parikh rewriting system, where 

R = {{ac,ca)} u { {abxba, baxab) | x € E* } u { {bcxcb, cbxbc) | x e E* }. 

Recall that fp is sound and complete. 

Since abc is the unique counter for when we write w ^r w', the number 
above -^r indicates the corresponding change in the number of occurrences of 

^2 ^3 

the subword abc, that is, |tc'|abc = \w\abc + 3. Clearly, il w = wq -^r wi -^r W2 -^r 
■■■ ^-^R Wn = w' and w =>fp w' (so \w\ahc = \w'\ab<^, then Yh=i = 0 must be zero. 

Remark 7.1. Note that each rewriting rule of ^ affects at most four letters. 
Hence, if w,w' e E’" and dist(ta,tc') = n, then w,w' can differ in at most 4n 
positions. Conversely, if w,w' differ in 4n positions and w = Wq -^r Wi ->-r 
W2 -^R ■■■ ->-R Wn = w' for some Wq, Wi, ... ,Wn ^ E*, then dist(tc, tc') = n. 
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Example 7.2. Consider the following seqnence of direct transformations. The 
affected letters in each step are highlighted in bold. 

w = abcbabacababcbabacab abcabbacababchabacba 

bacababcababcbabacba 

bacababcbaabcabbacba 

bacababcbabacababcba = w' 

By Remark 17.11 dist(ta,w') = 4 as w and w' differ in 16 positions. Note that 
no proper combination of the valnes above the arrows snms to zero. However, 
w w' is redncible becanse 

w = abcbahacahabcbabacab bacababcbaabcbahacah = w" 

bacababcbabacababcba = w' 

and dist{w,w") + dist(n;", w') = 2 + 2 = dist(ta, w'). 

The following technical lemma will be nsefnl in determining distij(tc, tc') in 
more complicated scenarios. 

Lemma 7.3. Suppose Ri = {{abxba,baxab) | a; e E*}, R 2 = {{bcxcb,cbxbc) \ 
X €T,*}, and R 3 = {(ac, ca)}. If w, w' € and w w' , then 

(1) TTa,biw) 7ra,b{w'), TTb,c{w) ^-Kb^dw'), and 'Ka,c{w) 

(2) distR{w,w') > distRdTTa,b{u’),TTa,b{'w')) + distRdnA^)^^b,ci.^')) 

+ distfig (7r„,c(«^), ■ 

Proof Snppose w = Wq ^r Wi ^r W 2 ^r ■■■ ^r WdistR{w,w') = W for some 
Wq,Wi, ... ,WdistR{w,w') ^ S*- Notice that exactly the rewriting rules induced 
by elements of Ri would affect the relative positions of the a’s and Us in w. 
Suppose ii,i 2 , ■ ■ ■ ,ip is an increasing enumeration of all the z’s such that the 
direct transformation Wi^i -^r Wi is carried out using a rewriting rule induced 
by an element of Ri. If no such i exists, then 'XaA'^) ~ '^aA^') ^>7 dehnition 

T^aA^) ^Ri 'RaA'^')- Otherwise, it should be clear that 

Ta,6('ic) ~^Ri 'Xa,bipRid) Ri ^a,{1(11^12) ~^Rl ' Rl ~ '^a,biR) ). 

Therefore, HaA^) ^Ri '^aA'^') p > distRg(7ra,b(za),7ra,fe(za')). Similarly, 
'RbA'^') ^aA'^) ^Rs '^aA'^')- The secoud part follows from 

the proof of the hrst part as Ri, R 2 , and R 3 are disjoint. □ 

Remark 7.4. The conclusion stated in Lemma 17.31 (2) cannot be strengthen 
to equality, not even assuming w =m w'- For example, let w = bcacabcabbca 
and w' = cabbcabcacab. Note that w =^r w' and w =m w'■ However, only the 
rewriting rules ac ca or ca ac can be applied on w but 'RaA'^) ~ ^aA'^') ~ 
cacacaca. Therefore, from the proof of Lemma 17^ we see that distij(tc, tc') is 
strictly greater than the sum of the corresponding distances on the right. 

Example 7.5. Since w = abbcacb abcbabc bacabbc = w', the transforma¬ 
tion w =>(p w' is irreducible with order 2. In this case, the two words happen 
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to be Mi?-equivalent, as 

w = abbcacb abbaccb baabccb baacbbc = w" ^r bacabbc = w'. 


Using this, we can address the uniqueness issue that was raised after Theo¬ 
rem [5]H By Lemma 17.31 dist(ta,ta") > 3 and hence it must be three. How¬ 
ever, w =>fp w” can be expressed as two sequences of irreducible transforma¬ 
tions (of distinct length) satisfying the conclusion of Theorem 15.41 namely. 


W w' 


=>fp w" and w abbaccb =>fp baabccb =>fp w". 


Theorem 7.6. For every positive integer n, there exist w,w' e S* such that 
w =>fp w' is irreducible with order n. 

Proof. The transformation ac =^tp ca is irreducible with order one. For every 
positive integer n, consider the following sequence of direct transformations: 


w = aPbcbaPad^ab aP ^bacaba^ ^ac^ab -^r ■■■ 

-^R bod^caFbacFab ba^ca^abc^ba = w'. 

Since the number of a between the first two 6 in tc is zero while that number 
is 2n + 1 in tc', it follows that dist{w,w') = n + 1. In fact, if w = wq ^r wi ^r 
■■■ Wn+i = w' for some Wo,Wi,... ,Wn+i e S*, then 

(1) each except one of the direct transformations applies the rewriting rule 
of the form abxba baxab for some x e S* such that the first and the 
second b in the corresponding word are affected; 

(2) exactly one of the direct transformations applies the rewriting rule of 
the form baxab abxba for some x e S* such that the second and the 
third b in the corresponding word are affected. 

This means that every 4 is -1 except one that is +n. Assume w =>fp w' 
is reducible. Then choose w" i {w,w'} such that w =>fp w" =>q 3 w' and 
dist(tc, ta") + dist(ty", tc') = n + 1. Let d = dist(t(;, tc"). Then 

^2 '^d // ^d+1 ^n+1 / 

W = Wo^rWi^r--- ^RWd = W ^ R... ^ R Wn+1 = W 

for some wo,wi,... ,Wn+i e S* such that 4 + 4 + + 4 = 0. However, by the 

observation above, 4 + 4 + •■• + 4 is either -d or n + 1 - d, a contradiction. □ 

Although Theorem 17.61 shows the existence of irreducible transformations of 
arbitrarily large order, as pointed out at the end of Section [HI it remains to be 
seen whether irreducible transformations of large order are indeed indispens¬ 
able. Our next theorem partially answers this question positively. 


Theorem 7.7. Irreducible transformations of^ of order three that cannot he 
expressed as a sequence of irreducible transformations of order at most two exist. 

Proof. (The readers may wish to verify the result on their own, perhaps aided 
by a computer, rather than following our detailed case by case proof here.) 
By the proof of Theorem 17.61 aabcbaaaccab baacaaabccba is an irreducible 
transformation of order three. We claim that it cannot be expressed as a se¬ 
quence of irreducible transformations of order at most two. First of all, suppose 
w is obtained from aabcbaaaccab using a sequence of irreducible transformations 
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of order one. Then it should be clear that w must be aabcbub for some u eTj* 
such that |M|a = 4, 11x16 = 0, and \u\c = 2. For easier visualization and brevity, we 
simply represent this by 

w = aabcb u b. 

4a,2c 

It suffices to show that no irreducible transformation of order two can be applied 
to w. We argue by contradiction. Assume w w' is irreducible with order 
two for some w' e S’". Then for some w" i {w,w'}, 

w w" w', where 0 ^ p = -q. 


For the purpose of referencing, let us refer to the rewriting rule employed by 
w ->-R w" (respectively w" ->-r w') as Rule I (respectively Rule II). Note that if 
Rule I is either abxba baxab or cbxbc ->■ bcxcb for some a; e S*, then Rule II 
must be either bayab abyba or bcycb cbybc for some p e S* and vice versa. 


Case 1. 


Case 2. 
Case 3. 


Case 4. 


Rule I is abxba baxab for some x eTj*. 

Then w = aabcba v b ->-r abacab v b = w". If Rule II is bcycb cbybc 

3a,2c 3a,2c 

for some i/ € S*. Then w" = abacabcaaacb ^r abacacbaaabc = w'. 
However, p + q = -l + 3i^0, a contradiction. Therefore, Rule II must be 
bayab abyba for some y € S*. There are two possible applications of 
this rule on w" that do not revert w” back to w. Since \v\c = 2, these two 
applications would increase the counter by at least two. Hence, again 
p + g it 0. 

Rule I is cbxbc bcxcb for some x . 

This case cannot happen as this rule cannot be applied on w. 

Rule I is baxab abxba for some x eT,*. 

+ 2 

Then w = aabcba v ab -^r aabcab v ba = w". Then Rule H must be 


2a,2c 2a,2c 

abyba bayab for some y as the rule cbybc bcycb for any y e S’" 
cannot be applied on w". There are two possible applications of this rule 
on w" that do not revert w" back to w. These two applications would 
decrease the counter by 1 and 3 respectively. Hence, again p + g it 0. 
Rule I is bcxcb cbxbc for some x € S’". 

There are two possible distinct applications of Rule I on tc. 

+4 

>R aacbb v be = w". 


— w = aabcb v cb 

4a, Ic 


4a, Ic 


+4 


— w = aabcbcaaaacb -^r aabccbaaaabc = w". 

If w" = aacbb v be, then Rule H must be cbybc ->■ bcycb for some y e S” 


4a,Ic 

as the rule abyba bayab for any y e T,* cannot be applied on w". 

-4 

However, this rule cannot account for w" -^r w', unless w' = w, which 

is not the case. On the other hand, if w" = aabccbaaaabc, then Rule H 

must be abyba bayab for some ?/ e S’" to avoid w" being reverted back 

_2 

to w. Thus w" = aabccbaaaabc ->-r abaccabaaabc = w'. Nevertheless, 
again p+g^O. □ 
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It should be pointed out that irreducible transformations of ^ of order three 
that can be expressed as a sequence of irreducible transformations of order at 
most two do exist. Without going through the similar tedious arguments, we 
claim that abcbcbacab bacabcbcba is irreducible with order three. On the 
other hand, abcbcbacab =><p abchcbaach =>fp baccbababc =>!p baccabbbac =4>tp 
bacacbbbac =>fp bacacbbbca =>fp bacabcbcba is a sequence of irreducible trans¬ 
formations of order at most two. 

8. Conclusions 

Parikh rewriting systems are feasible alternatives to Thue systems in the 
quest for characterization of M-equivalence. As highlighted by Theorem 15.51 
the irreducible transformations of a sound and complete Parikh rewriting sys¬ 
tem leads to a Parikh sound and Parikh complete Thue system. It is conceivable 
that with an ingeniously chosen Parikh rewriting system with a decidable set of 
rewriting rules, the set of all irreducible transformations may have an effective 
description. While our immediate goal is the ternary alphabet, this approach 
may prove to be a universal way of generating Parikh sound and Parikh com¬ 
plete Thue systems for every alphabet. 

For the ternary alphabet, the only Parikh rewriting system ^ that we have 
analyzed in the previous section is chosen due to its appearance in the literature 
and its canonicalness. It is possible to give an exhaustive description of all its 
irreducible transformations of order two or three. However, this task is not 
carried out explicitly as we believe that not every irreducible transformation 
of ^ can be expressed as a sequence of irreducible transformations of order at 
most three anyway. As the proof of Theorem 17.71 is by a tedious case analysis, 
its proof cannot be adapted to analogous results for higher order. Hence, the 
following is left as a conjecture. 

Conjecture 8.1. Irreducible transformations o/^ of arbitrary large order that 
cannot he expressed as a sequence of irreduccible transformations of lower order 
exist. 
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